Consistency of Sequential Bayesian Sampling Policies
نویسندگان
چکیده
We consider Bayesian information collection, in which a measurement policy collects information to support a future decision. This framework includes ranking and selection, continuous global optimization, and many other problems in sequential experimental design. We give a sufficient condition under which measurement policies sample each measurement type infinitely often, ensuring consistency, i.e., that a globally optimal future decision is found in the limit. This condition is useful for verifying consistency of adaptive sequential sampling policies that do not do forced random exploration, making consistency difficult to verify by other means. We demonstrate the use of this sufficient condition by showing consistency of two previously proposed ranking and selection policies: OCBA for linear loss, and the knowledge-gradient policy with independent normal priors. Consistency of the knowledge-gradient policy was shown previously, while the consistency result for OCBA is new.
منابع مشابه
Asymptotic Optimality of Sequential Sampling Policies for Bayesian Information Collection
We consider adaptive sequential sampling policies in a Bayesian framework. Under the assumptions that the sampling distribution is from an exponential family and that the number of distinct measurement types is finite, we give sufficient conditions for an adaptive sampling policy to achieve asymptotic optimality. Here, asymptotic optimality is understood to mean that the limit of the expected l...
متن کاملConvergence to Global Optimality with Sequential Bayesian Sampling Policies
We consider Bayesian information collection, in which a measurement policy collects information to support a future decision. This framework includes problems in ranking and selection, reinforcement learning, and continuous global optimization. We give sufficient conditions under which measurement policies achieve asymptotically minimal expected loss. Achieving asymptotically minimal expected l...
متن کاملAGM-consistency and perfect Bayesian equilibrium. Part II: from PBE to sequential equilibrium
In [6] a general notion of perfect Bayesian equilibrium (PBE) for extensive-form games was introduced and shown to be intermediate between subgame-perfect equilibrium and sequential equilibrium. Besides sequential rationality, the ingredients of the proposed notion are (1) the existence of a plausibility order on the set of histories that rationalizes the given assessment and (2) the notion of ...
متن کاملPolicy Explanation and Model Refinement in Decision-Theoretic Planning
Decision-theoretic systems, such as Markov Decision Processes (MDPs), are used for sequential decision-making under uncertainty. MDPs provide a generic framework that can be applied in various domains to compute optimal policies. This thesis presents techniques that offer explanations of optimal policies for MDPs and then refine decision theoretic models (Bayesian networks and MDPs) based on fe...
متن کاملOnline Bayesian phylogenetic inference: theoretical foundations via Sequential Monte Carlo.
Phylogenetics, the inference of evolutionary trees from molecular sequence data such as DNA, is an enterprise that yields valuable evolutionary understanding of many biological systems. Bayesian phylogenetic algorithms, which approximate a posterior distribution on trees, have become a popular if computationally expensive means of doing phylogenetics. Modern data collection technologies are qui...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- SIAM J. Control and Optimization
دوره 49 شماره
صفحات -
تاریخ انتشار 2011